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Outline 


• Semiconductors: The Evolution of ICs 

- Availability and Technology 

• 1C Selection Requirements - three fields of 
thought 

- Technical - “The Good” 

- Programmatic - “The Bad” 

- Risk/Reliability - “The Ugly” 

• Reliability and Radiation 

• Radiation Perspective - Four methods of 
selecting ICs for space systems 

- Guaranteed hardness 

- Historical ground-based radiation data 

- Historical flight usage 

- Unknown assurance 

« Understanding Risk 

- Risk trade space 
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NASA 


NASA 


The Growth in 1C Availability 


The semiconductor industry has seen an explosion in the 
types and complexity of devices that are available over the 
last several decades 

- The commercial market drives features (VI 

• High density (memories) 

• High performance (processors) 

• Upgrade capability and time-to-market (FPGAs) d 

• Wireless (RF and mixed signal) 

• Long battery life (Low-power CMOS) 

I 


Integrated Cycling Bib 
and MP3 


The Changes in Device Technology 


Besides increased availability, many changes have taken 
place in 

- Base technology, 

- Device features, and, 

- Packaging 

The table below highlights a few selected changes 


circa 1990 circa 2007 

bulk CMOS/NMOS CMOS with strained Si or SOI 
> 2.0 um 65 nm 


Feature 


Base technology 
Feature size 
Memory size - 
volatile (device) 
Processor speed 
FPGA Gates 
Package 

Advanced system 
on a chip (SOC) 
features 


1 Gb 
> 3 GHz 
> INI 

FCBGA - 1500 balls 
>Gbps Serial Link, Serdes, 
embedded processors, 
embedded memory 


Cache memory 
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Considerations since 
the “old days” 

- High reliability (and 
radiation tolerant) 
devices 

• Now a very small market 
percentage 

-Commercial 

“upscreening” 

• Increasing in importance 

• Measures reliability, 
does not enhance 

- System level 
performance and risk 

Hardened “systems” not 
devices 


SDRAM? 


Flash? 


System Designer 
Trying to meet high-resolution 
instrument requirements AND 

long-life 




1C Selection Requirements 


To begin the discussion, we shall review 1C 
selection from three distinct and often contrary 
perspectives 

- Performance, 

- Programmatic, and, 

- Reliability. 

Each of these will be considered in turn, however, 
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NASA 


NASA 


electric*! 


Programmatic Requirements and 
Considerations 


Rationale 

- Trying to keep a program on schedule 
and within budget 

Personnel involved 

- Project manager, resource analyst, 
system scheduler 

Usual method of requirements 

- Flowdown from parent organization 
or mission goals for budget/schedule 

• i.e., Launch date 

Buzzwords 

- Cost cap, GANTT/PERT chart, risk 

matrix, contingency 


Parent organization makes final 
decision 
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NASA 


N'^SA 


An Example “Ad hoc” Battle 


• Mission requirement: High resolution image 

- Flowdown requirement: 14-bit 100 Msps ADC 

• Usually more detailed requirements are used such as . 

ENOB or INL or DNL as well t 

- Designer 

• Searches for available radiation hardened ADCs that 
meet the above requirements 

• Searches for commercial alternatives that could be 
upscreened 

- Manager 

• Trades the cost of buying Mil-Aero part requiring less 
aftermarket testing than a purely commercial 1C 

• Worries over delivery and test schedule of the candidate 
devices 

• Evaluates existing device data to determine reliability 
performance and additional test cost and schedule. ^ 
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Traditional Risk Matrix 


Risk Tolerance Boundary 
Placed on the profile to reflect 
Corporate “Risk Appetite” 


By adjust the level of 
currency hedging, resources 
can be released to help fund 
improvements to protection of 
the production facility. 


Caution Zone 
Risks in fire “yellow” area 
need constant vigilance 
and regular audit 


Impact 


Likelihood Scale: A. Very High B: High C: Occasional D: Low E: Very Low F: Almost Impossible 

Impact Scale: I: Catastrophic II: Critical III: Significant IV: Marginal 


Reliability “versus” Radiation 
Basic Electronics Reliabiiity 


Reliability of electronics is viewed traditionally using a 
“bathtub” curve view of mean time to failure (MTTF) 

- This looks at both intrinsic (wearout) and extrinsic (infant 
mortality) failure modes 


The Bathtub Curve 

Hypothetical Failure Rate versus Time 


End of Life Wear-Out 
increasing Failure Rate 


Infant Mortality 
Decreasing Failure Rate 


Normal Life (Useful Life) 
Low "Constant" Failure Rate 


Time 
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Radiation Effects and Reliability 




Radiation reliability is viewed differently than a 
normal reliability (bathtub) consideration 
- It is a mix of a MTTF (or Time to First Failure - TTFF) 
condition known as Total Ionizing Dose (TID) or 
Displacement Damage (DD) and a Mean Time Between 
Failures (MTBF) condition known as Single Event 
Effects (SEE). 



Radiation and Traditional 1C Reliability 
Are the two related? 


The short answer is yes 

- Radiation MTTF conditions can accelerate reliability 
wearout mechanisms either by eroding electrical signal 
margins or material damage 

- Radiation MTBF conditions also can impact long-term 
reliability 

• A single energetic particle, for example, can cause device 
failure instantaneously (such as with a gate rupture) or at a 
later time due to material damage. 

The methods of coupling the radiation-induced impacts 
into reliability calculations are limited 

- NEPP Program has a new effort to explore this coupling on 
state-of-the-art commercial Flash memories that are highly 
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Flight Program: J 

Typical Radiation Hardness Assurance (RHA) Flow 


Environment 

Definition 


Design 

Evaluation 


Project 

Requirements 

and 

Specifications 


In-Flight 

Evaluation 


Parts List Screening 
Radiation 
Characterizations, 
Instrument 
Calibration, 
and Performance 
Predictions 


External Environment! 


Technology 

Performance 

Anomaly 

Resolution 


Environment in 
the presence of 
the spacecraft 


..... -. t . : . : . 

Technology Hardness | 
Design Margins 


cra “"' 





Radiation Perspective on 1C Selection 1 

From the radiation perspective, ICs can be viewed 
as one of four categories. 

- Guaranteed hardness 

• Radiation-hardened by process (RHBP) 

• Radiation-hardened by design (RHBD) 

- Historical ground-based radiation data 

• Lot acceptance criteria 

- Historical flight usage 

• Statistical significance 

- Unknown assurance 

• New device or one with no data or guarantee 






illfip : Wm$M ■"'■■■■ 
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“Guaranteed” Radiation Tolerance 

A limited number of semiconductor manufacturers, 
either with fabs or fabless, will guarantee radiation 
performance of devices 

- Examples: 

• ATMEL, Honeywell, BAE Systems, Aeroflex 

- Radiation qualification usually is performed on either 

• Qualification test vehicle, 

• Device type or family member, or 

• Lot qualification 

- Some vendors sell “guaranteed” radiation tolerant devices 
by “cherry-picking” commercial devices coupled with 
mitigation approaches external to the die 

The devices themselves can be har dened via 

- Process or material (RHBP or RHBM), 

- Design (RHBD), or . Is 

- Serendipity (RHBS) 





INI ASA 






Bpa 

mm 






X Has > 
— ►^rocess/foundi 
YES >^ an g ed V 


Sufficient 
test data? 


rest method 
^applicable?, 


Same 
wafer lot? 


Does data 
. exist? . 


Data usable 




Archival Radiation Performance - 
Ground-based Data 

Reviewing existing ground radiation test data on a 1C and 
it’s application has been discussed previously 

- For example. Christian Poivey at NSREC Short Course in 2002 

- Using a “similar” device with data is risky, but sometimes 
considered (though not recommended) 

In general, the flow is shown below 
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Some heritage designs last 
better than others 




Archival Radiation Performance - mm 
Flight Heritage 

Can we make use of parts with 
flight heritage and no ground 
data for new mission? 

Similar flow to using archival 
ground data exist, but consider 
as well 

- Statistical significance of the 
flight data 

• Environment severity? 

• Number of samples? 

• Length of mission? 

- Has storage of devices affected 
radiation tolerance or reliability? 

- And so forth 

This approach is rarely * 

recommended by the radiation 


IC’s with no Guarantee or Heritage 




• Radiation testing is required in 
the vast majority of cases 



SDRAM mounted on a daughtercard 


- Testing complexities and 
challenges are discussed 
elsewhere (e.g., Swift during this 
short course, LaBel during the 
conference) 

- The true challenge is to gather 
sufficient data in a cost and 
schedule effective manner. 

• A backup plan should be made in 
case device fails to pass radiation 
criteria. 

• Reliability testing has similar 
concerns 

33 '■ 3 ; 3 ■ i:3 


FPGA-based motherboard 
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Is Testing Always Required? 


Exceptions for testing may include 

- Operational 

• Ex., The device is only powered on once per orbit and the 
sensitive time window for a single event effect is minimal 

- Acceptable data loss 

• Ex., System level error rate may be set such that data is 
gathered 95% of the time. This is data availability. Given 
physical device volume and assuming every ion causes 
an upset, this worst-case rate may be tractable. 

- Negligible effect 

• Ex., A 2 week mission on a shuttle may have a very low 
TID requirement. TID testing could be waived. 


\ 


taa 



Understanding Risk 


Risk for a mission falls in to the same topic areas as parts 
selection 

- Technical, programmatic, and reliability 
Technical risks 

- Relate to the circuit designs not being able to meet mission 
criteria such as jitter related to a long dwell time of a telescope 
on an object 

Programmatic risks 

- Relate to a mission missing a launch window or exceeding a 
budgetary cost cap which can lead to mission cancellation 

Reliability risks 

- Relate to mission meeting its lifetime and performance goals 
without premature failures or unexpected anomalies 


risk types 
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The Risk Trade Space - 

Considerations for Device Selection (Incomplete) 


Cost and Schedule 

- Procurement 

- NRE 

- Maintenance 

- Qualification and test 
Performance 

- Bandwidth/density 

- SWaP 

- System function and 
criticality 

- Other mission constraints 


Design Environment and 
Tools 

- Existing infrastructure and 
heritage 

- Simulation tools 
System operating factors 

- Operate-through for single 
events 

- Survival-through for portions 
of the natural environment 

- Data operation (example, 

95% data coverage) 

Radiation and Reliability 

- SEE rates 

V Lifetime (TID, thermal, 

reHahiitty, ...) • ■ " ■ '■■■:. ' ■■■■■■ 



r is often the “person in the 


The systems < 
middle” evalu 
working with i 
risk levels 




Systems Engineering and Risk 

• The determination of acceptability for 
device usage is a complex trade space 

- There is often more than one answer that’s 
acceptable 

• A more omnidirectional approach is taken 
to evaluate the various risks 

- Each of the three factors may be assigned 
weighted priorities 
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Example: Considerations for Selecting a GBf 
“Custom” Device 

Three basic device type options 

- Custom ASIC (CA) 

• Also called standard cell or just ASIC 

• Essentially a fully-custom design 1C 

- Structured ASIC (SA) 

• Newer device that is a semi-custom design using built-in 
functional blocks that are interconnected for a user’s design 
by the device manufacturer 

- Field Programmable Gate Array (FPGA) 

• One-time or reprogrammable interconnecting of logic 
performed by the user or can be done in-circuit 
{reprogrammable options) 

These class of devices now have so much 

functionality that they can be classified as system 

onachipCSOGtl.-l,;;:!' ' ' ' ^ '• ^ ’ •'*" 

fill use tli 


Sample System Implementation for 
the Three Styles of FPGAs 



One-time 


Programmable 

(OTP) 

FPGA 


Increasing 

System 

Complexity 



Circuits to interface 
between FPGA and 

lonfiguraUon^ uploads 


FPGA 


j!j§§ f 





Circuits to interface 
between FPGA and 
ground for new 


Non-volatile 

Reprogrammable (Flash) 

FPGA 

configuration uploads 
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The Trade Space Curve for Performance 




• Application-specific trade-offs are often made based on the 
design complexity of the application requirements. 


- Note: FPGAs and SAs are moving to the right as 

semiconductor processes scale to smaller transistor feature 



Structured ASICs 
High Cost 
Low NRE 


Low Predictability 
Low Reliability 
High cost 
High NRE 


» High Complexity 
Mid -Complexity “Science Projects 


Low Com] 


Source: http://wvm.fpgajournal.corn 


Majority of NASA needs 


High Predictability 
High Reliability 
Low Cost 
Medium NRE 


Programmatics: 
Development Schedule 




• Example of a time to market (TTM) for commercial options 

- Assumptions 

• 90nm technology, < IMgate device 

- SA is —1/2 TTM of a CA 


- FPGAs are likely < Vi of the SA TTM 

- Note: This does not consider reliability or radiation qualification 
times 



Comparing 
SA vs. CA 
tor TTM 


2nd p*vfc« 
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of designs requiring re-spin 


Months 


TecnriQiogy 


Average Re-Spin time 

60/ 

50"' 



_ ■ ■ 

40" 

m ■ ■ 

30" 


Programmatics: 

What if the first design is incorrect? 

As digital devices pack more bits into the same physical 
space (i.e., technology feature size is shrinking), more 
designs require a “second pass” or re-spin. 

Even worse, the time it takes for this re-spin has increased 


Programmatics: Development Cost 


"ji 


100 % 


62% 


13% 


;250KU) 


$10M 

$9M 

$8M 

$7M 

$6M 

$6M 

$4M 

$3M 

$2M 


1995 



1007 1090 2001 2003 

(0.36um) (0.26um) (0.1 8um) (0.13um) (0.1 Oum) 

Graphic from MIPS Technologies "The Coming Reality for SOC Designers' by John £ 

1 , 0 * 
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Selection Criteria: Radiation and Reliability 

While there is no “generic” answer for radiation 
tolerance and reliability levels in the trade, there 
are numerous considerations such as 

- What is known of the process radiation tolerance? 

• Was process/device radiation qualification sufficient for 
planned design or application? 

- Library? Cells? Speed? Etc... 

* Is the process reliable and achieving good yield? 

- High-volume commercial fab vs. low-volume niche fab? 

• Are there lot-specific or application-specific concerns? 

- Is there a specific performance requirement such as 
reconfigurability or ultra-low power that forces a non- 
radiation tolerant device selection? 

* How does a fault-tolerant system architecture enter the 
equation? 

- How testable is the design/device? 






Current Radiation Hardening Considerations 


RH levels can be Circuit tiles can 
“tuned*’' via incorporate 
RHBD or RHBP; mitigation 

Difficult for options 

radiation testing 


IP: Limited RH IP 
available; 
Commercial IP? 

Commercial 
manufacturers 
working soft error 
rate (SER) issues 


External and 
internal SEE 
mitigation (TMR, 
scrubbing, 
reconfiguration, 
etc) for logic and 


Notes 


>5 * A. v'.- 

1 Bll 1 


Category 

Generic CMOS 
Technology 

CA 

SA 

FPGA 

RH Availability 

Poor 

Niche vendors 
for RH; 
Commercial 
would rely on 
RHBS and 
RHBD 

I- 

Niche vendors - 
none currently 
available, but 
several vendors 
are developing 

Limited, but new 
developments 
underway 

TID 

RHBS; improving 
with scaling 

RHBS, RHBD, 
RHBP 

ii'., .... ' , 

RHBS, RHBP, 
partial RHBD 

RHBS 

SEE 

RHBS - not; 
increasing 
sensitivity; SOI? 

RHBS, RHBD, 
RHBP 

(substrate) - v 

RHBS, RHBD 
partial, RHBP 
(substrate) 

RHBS 
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New Architectures 
-new interconnects 
-new power distribution 
-new frequencies 


New Workmanship 
-inspection, lead free 
-stacking, double-sided 
-signal integrity 


New Design Flows/Tools 

-programming algorithms, application 
-design rules, tools, simulation, layout 


New Package 
-Inspection 
-Lead free 


circuitry (pow#, memory, 10, et< 

' system reliability and radiation | 


Related SOC Radiation and Reliability 
Considerations 


New Silicon 
-90 nm CMOS 
-new materials 


New Connectors 
-higher-speed, lower noise 
-serial/parallel 


New Board Material 
-thermal coefficients 
-material interfaces 


Summary of Generalized Features - 
ASICs versus FPGAs 




c “* 90 ' y 

SRAM - 
FPGA 

OTP FPGA 

SA 

CA 

NRE 

Low 

Low 

Med 

High 

Production Cost 

High 

High 

Med 

High 

Risk 

Low 

Low 

Med 

High 

Development 
Span (TTM) 

Low 

Low 

Med 

High 

Electrical 

Performance 

Low 

Low 

Med 

High 

Density/Capacity 

Low 

Low 

Med 

High 

Power 

Consumption 

High 

Med 


Med 

Low 

Flexibility 

High 

... „ 

Med 


Med 

Low 

Radiation 

Performance 

Low 

- ■ -■■ ■ ' 

- -• 1 - • 1 ' 

Mad 



High 


iCand Otwrofttmtf Se lection for -ftweeri tort trsr fteffl w frfrA. I 




To be presented by Kenneth LaBel at NASA Quality Leadership Forum (QLF), March 5-6, 2008, Cape 
Canaveral, FL. 


17 




Conclusions 



• In this talk, we have presented considerations for selection 
of ICs for space systems 

- Technical, programmatic, and risk-oriented 

• As noted, every mission may view the relative priorities between 
the considerations differently 

• We have also noted a specific type of example, that of 
custom to semi-custom devices 

• As seen below, every decision type may have a process. 
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